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ABSTRACT 

Acoustic studies have shown that phonetic context 
can have substantial effects on the cues associated with a given 
speech sound* The present study investigates whether or not 
modifications in the acoustic correlates of initial stops and 
fricatives due to the following vowel can affect phonemic decision 
processes* In the first of two experiments, C-v syllables comprised 
of a stop plus a vowel were paired and presented to 36 first-grade 
subjects in a discrimination task; in the second experiment, 
fricatives were involved instead of stops* The results for Experiment 
I showed that subjects discriminated the stops significantly better 
in long vowel contexts than in short vowel contexts* Results for 
Experiment II showed that discriminations of place contrasts 
involving /s/ or /z/ as well as the homorganic voicing contrasts were 
not subject to differential vowel effects. Discrimination of /f/ from 
voiceless /th/ and /v/ from voiced /th/ # however, were significantly 
better in back vowel contexts than in front vowel contexts* 
Discriminations of /f/ from voiceless /th/ and /v/ from voiced /th/ 
were found to be significantly more difficult than the 
discriminations of the other fricative contrasts. Results show that 
effects of coarticulation affect discrimination probabilities* These 
findings question theories of one-to-one correspondence between the 
acoustic segment and the sound perceived* (Author/AMN) 
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STATEMENT OF FOCUS 



The Wisconsin Research and Development Center for Cognitive Learning 
focuses on contributing to a better understanding of cognitive learning 
by children and youth and to the improvement of related educational prac- 
tices. The strategy for research and development is comprehensive. It 
includes basic research to generate new knowledge about the conditions 
and processes of learning and about the processes of instruction, and 
the subsequent development of research-based instructional materials, 
many of which are designed for use by teachers and others for use by 
students. These materials ?re tested ard refined in school settings. 
Throughout these operations behavioral scientists, curriculum experts, 
academic scholars, and school people interact, insuring that the results 
of Center activities are based soundly on knowledge of subject matter 
and cognitive learning and that they are applied to the improvement of 
educational practice. 

This Technical report is from the Basic Pre-Reading Skills: Identi- 

fication and Improvement Project in Program l v General objectives of the 
Program are to generate new knowledge about concept learning and cognitive 
skills, to synthesize existing knowledge, and to develop educational mater- 
ials suggested by prior activities. Contributing to these Program object- 
ives, this projects basic goal is to determine the processes by which 
children aged 4 to 7 learn to read, examining the development of related 
cognitive and language skills, and to identify the specific reasons why 
many children fail to learn to read. Later studies will be conducted to 
find experimental techniques and tests for optimizing the acquisition of 
skills needed for learning to read. By-products of this research program 
include methodological innovations in testing paradigms and measurement 
procedures; the present study is an example. 
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Abstract 



Acoustic studies have shown that phonetic context can have substan- 
tial effects on the cues associated with a given speech sound. The pre- 
sent study investigates whether or not modifications in the acoustic 
correlates of initial stops and fricatives due to the following vowel 
can affect phonemic decision processes. 

Two experiments were conducted to investigate contextual effects. 

In experiment I, C-V syllables comprised of a stop plus a vowel were 
paired and presented to 36 first grade Ss in a discrimination task. 

An A-B-X paradigm was employed, using a stereo ‘tape recorder with two 
speakers. Nine minimal place and voicing contrasts involving the stops 
/p/, /b/, /t /, /d/, /k/, /g/ were presented, each in the context of 
eight different vowels. Vowels could be grouped into high and low, long 
and short, and front and non-front for analysis. 

Experiment II differed frou Experiment I only in that it involved 
fricatives instead of stops. The six fricatives / f / , /v/, / 9/ , /#/ , /s/ 
/z/ were employed in making up the nine contrasts. 

The results for Experiment I showed that Ss discriminated the stops 
significantly better in long vowel contexts than in short vowel contexts 
The discrimination rates for each contrast, collapsed over all vowels, 
did not differ from one another. 

For Experiment II, the results indicated that discriminations of 
place contracts involving /s/ or /z/ as well as the homorganic voic- 
ing contrasts were not subject to differential vowel effects. Discrim- 
ination of /f/ from /0/ and /v/ from /j/, however, were significantly 
better in back vowel contexts than in front vowel contexts. Discrimin- 
inations of /f / from /€/ and /v/ from /gr/ were found to be significantly 
more difficult than the discriminations of the other fricative contrasts 

The results show that effects of coarticulation do affect discrim- 
ination probabilities. These findings call into question theories that 
propose any one-to-one correspondence between the acoustic segment and 
the sound perceived. 
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Chapter I 



INTRODUCTION 

Experimental phoneticians have devised a number of strategies 
to determine the cues in the acoustic signal on which the listener 
relies for phoneme recognition. A factor which most of these 
strategies have in common is the identification of isolated nonsense 
syllables. This forces the listener to make his response to the 
physical stimulus alone. Semantic, syntactic, and suprasegmental 
clues are eliminated. An increase in errors can be induced by 
various methods of signal at tenuat ion, , masking , or filtering. 

But responses elicited in noise or under conditions of filtering 
have inherent drawbacks, since may of the important questions 
concerning phoneme recognition relate to normal processing. In the 
present study, an attempt is made to investigate some of the' 
variables related to phonemic decision processes by studying 
children's performance on a consonant discrimination task. Specifically, 
the purpose of the study ic to ascertain the effects of varying the 
vowel on the child's ability to discriminate minimal consonant 
contrasts. 
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The acoustic correlates of consonant production are modified 
due to the influence of adjacent sound segments* This has been 
documented by several acoustic studies (e.g*, see ohman, 1963; 

1966), Thus, the acoustic correlates of any single consonant 
are subject to wide variation* 

Furthermore, it has been shorn thst the cues which lead the 
listener to a decision about a single phoneme are provided by 
several successive segments in the acoustic signal. (For a thorough 
review of the studies related to this question, see Kozhevnikov 
and Chistovich (1965))* On the basis of this finding, several 
investigators have concluded that the minimal acoustic unit by 
which phonemic deci^ons are made is the syllable (Kozhevnikov and 
Chistovich, 1965; Lyublinskaya, 1966; Liberman, Cooper, Shcnkweiler, 
and Studdert-Kennedy , 1967; Bondarko, 1969)* 

On the other hand, the authors of distinctive feature theory, 
while aware of the contextual modification of phoneme cues, did 
not feel compelled to propose a decision unit larger than the sound 
segment Itself (Halle, 1956; Jakobaon and Halle, 1956)* These 
investigators have proposed that each phoneme is represented by a 
unique bundle of features, which serve to distinguish a given 
phoneme from all others. The features which specify a phoneme 
are present in the signal in whatever context the phoneme occurs, 
and it is on the basis of these features that a phonemic decision is 
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Strong objections have been raised to this aspect of tha theory. 
Fry (1956) points out that if distinctive feature theory actually 
proposes a one-to-one relationship between the acoustic segment 
and the phoneme perceived, then the theory is difficult to reconcile 
with experimental results which indicate the diverse acoustic 
clues which lead the listener to the same phonemic decision (p. 170). 
Bush (1964) conducted an acoustic study in which she tested two of 
the feature oppositions and found that under the influence of certain 
contexts the oppositions were neutralized or even reversed. This 
criticism strikes at the proposed invariance of the features. 

Bush saw that the authors of the theory were aware of the wide 
va* ations that occur in the acoustic correlates of a phoneme because 
of context. However, in her words, "Their assumption is that all 
such modifications lie well within the acoustic specification of the 
distinctive features and the distinctive feature specification of 
the phoneme (Jakobson and Halle, 1956; Lotz, 1950)." It is important 
to note that Bush did not feel that her results justify any 
rejection of distinctive feature theory. She merely felt that the 
theory needed to be reconciled with her findings. 

Recently an attempt was made to reconcile distinctive feature 
theory with the comments of Fry and the findings of Bush- Bondarko 
(1969) felt that the theory designating the syllable as the 
minimal decision unit was correct, and, at the same time, that 
distinctive feature theory is "without doubt the most economic 
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and systematic description of phonemes, . . . (p. 1)" Thus, he 
proposed that the realization of the distinctive features be 
specified in terras of the whole syllable* He based this proposal 
on the fact that it is precisely the coarticulation of sounds in 
the syllable that determine the nature of the allophone realized. 

However helpful the acoustic and articulatory research can 
be in refining a theory like that concerned with distinctive 

features. The theory also relates to perception. It seems 
imperative to show that the acoustic effects of coarticulation 
do, in fact, affect phonemic decision processing. 

Thus, this study undertakes to show that the child’s performance 
on a consonant discrimination task will be differentially affected by 
the phonetic context in which the consonant contrast is presented. 
Since it has been shown that the effect of phonetic environment on 
the acoustic parameters associated with a given consonant can be 
substantial* 1 % should be the case that certain environments will 
have a more favorable effect on ths phonemic decision than other 
environments. 
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Chapter II 

BACKGROUND OF THE PROBLEM 



ACOUSTIC PROPERTIES OF STOPS AND FRICATIVES IN CV SY LIABLE S 

In this section, the acoustic cues related to the discrimination 
cf one stop from another and one fricative from another in CV syllables 

will be discussed on the basis of research previously reported in the 
literature, Wherever possible, any influence which varying vowel 
context may have on these cues will be considered. A survey of what 
is known about the acoustic parameters that come into play in recog- 
nition or discrimination tasks permits some general predictions 
concerning the outcome of the proposed experiments 

Stops 

Stop or plosive consonant sounds are produced when the articu- 
lators close off the airstream at some point in the vocal tract, 
causing a build-up of pressure behind the closure. The release of 
the air built up behind the closure causes a transient "burst" of 
noise which is the result of turbulence in the airstream at the point 

of release. At the moment xn time when the release occurs, the articu- 
lators are moving into the configuration appropriate for the next 
sound segment. These rapid movements of the articulators are reflected 
In the acoustic signal as frequency shifts of the major resonances 
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and are called transitions. Figure 1 is a stylized acoustic represen- 
tation of CV syllables, one (a) Initiated by a voiced stop and another 
(b) initiated by a voiceless stop. 

The longer duration of noise characteristic, of the voiceless 
plosive represents the burst plus the aspiration which typically is 
present after the release of this class of stops in American English^ 
The dark line evident throughout the closure duration of the voiced 
stop represents the feature of voicing which has its onset typically 
during the closure of this class of stops. 

English stops are distinguished according to the point in the 
vocal tract where closure occurs. English speakers produce stops 
at 3 places of articulation; corresponding voiced and voiceless 
cognates are present at each place of articulation. 





bilabial . 


alveolar 


velar 


voiceless 


P 


t 


k 


voiced 


b 


d 


g 


Frequency. Intensity 


. and Durational 


Characteristics 


of the 



NOISE Portion . The burst of noise associated with the release of a 
particular stop consonant has «* major concentration of energy centered 
in a certain area of the frequency spectrum. English /p/, / t/, and 
/k/ and their voiced cognates / b/, / d / , / g/ have been the subject of 
acoustic research with regard to this feature. Halle, Hughes, and * 
Radley (1957) presented the following generalizations about the 
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Silence 



Fig. 1. 



Time (msec) 



— ► 

Sylized acoustic representation of stop + vowel syllables according 
to the dimensions of frequency chsracterist ics over time. In time, 
the sequence of articulatory events is as follows: Closure 

duration, release, transition, vocalic steady state (from Slis and 
Cohen 1969)* 
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spectral properties of the bursts associated with the three classes 
of stops: 

/p/ and /b/, the bilabial stops, have a primary concentration 
of energy in the low frequencies (500-1500 cps,), 

/ t/ and /d/ # the alveolar stops, have either a flat spectrum 
or one in which the higher frequencies (above 4000 cps.) 
predominate . 

/k/ and /g/, the palatal and velar stops, show strong concentra- 
tions of energy in the intermediate frequency regions (1500- 
4000 cps .) . 

These generalizations were confirmed in research using synthetic 
speech stimuli by several other investigators (Liberman, Delattre, & 
Cooper, 1952; Ainsworth, 1968). On the other hand, it has also been 
shown that a great deal of variation exists in the burst frequency 
associated with each stop, depending on the following vowel (Liberman, 
et al. , 1952; Fischer-J^rgensen , 1954). 

While the burst frequency of a stop can serve as a cue for the 
identification of that stop, several studies indicate that accurate 
perceptions of stop sounds can be obtained even without the burst cue 
present (Liberman, et al. , 1952; Halle, et al. , 1957; Grimm, 1964). 
This should not be interpreted as an assertion that the frequency of 
the noise burst is not utilized in perception, only that it does not 
seem to be essential. Halle, et al. (1957) showed that, with training, 
fairly accurate perception of stops can be achieved in response to 
isolated noise bursts. Minifie, Rudegeair, Milstein, and Vivion (in 




preparation) spliced out initial transition portions of taped CVC 
syllables and found that accurate stop recognition was still possible 
100% of the time. 

Since it is known that the nature of the following vowel does 
affect the frequency of the burst, and that this in turn may affect 
discrimination performance, it is appropriate to consider what the 
outcome of such an effect might be in terms of the present study. 

With regard to the place contrasts, / p/ vs. / t / , / p/ vs. /k/ , 
and / t/ vs. /k/, as well as the corresponding contrasts among the 
voiced stops, differential discrimination rates may be found between 
vowel environments which render tho contrasting bursts more or less 
similar. A clear example of this might occur with the /p/ vs. /k/ 
contrast. Before back vowels, the /k/ burst has in common with the 
/p / burst a rather low-f requeue^ concentration of energy. Before 
front vowels, however, as /k/ becomes more palatal rather than velar, 
the energy concentration associated with the burst occurs at a higher 
frequency, potentially rendering / k/ less confusable with / p/ . 

The intensity of the burst is known to vary according to whether 
the stop is voiced or voiceless (Halle, et al. > 1957), but no differ- 
ential vowel effects are predicted on the basis of this finding, at 
least not in regard to the homorganic voicing contrasts. It is 
possible that certain vowels may influence the intensity levels of 
consonantal bursts within voiced and voiceless classes, thus creating 
varying intensity differences among the place contrasts. Wang 
and Fillmore (1961) hypothesized that they would find better 



